Live freelance tracking. Raw descriptions turned into structured data. Find your next tech project without the noise.
upwork.com π 2026-05-20
πΉ European Intercity Transport Data Pipeline
π€ Client: UKR Member since 2024-07-18
π° Price: ****
π© Problem: Lack of a consolidated, real-time schedule and pricing dataset for multiple European bus and rail carriers.
π¦ Existing: GTFS reference files
Specifications:
[Target] European bus and railway carriers (RegioJet, ΔeskΓ© drΓ‘hy, MAV, Trenitalia, Omio, Trainline)
[Target] Intercity routes between major European cities
[Method] Layer 1: Full initial extraction (30-day window)
[Method] Layer 2: Periodic schedule synchronization
[Method] Layer 3: On-demand real-time availability and price verification
[Stack] Python
[Stack] Modular carrier-specific scrapers
[Format] GTFS-compliant tables (stops, routes, trips)
[Format] stop_id, stop_name, city, country, lat, lon
[Format] route_id, operator, transport_type, origin_stop_id, destination_stop_id
[Format] trip_id, route_id, departure_datetime, arrival_datetime, duration_minutes, is_overnight
[Security] Anti-bot bypass mechanisms
[UI/UX] Response time < 1 second for Layer 3 requests
Workflow:
1. Initial bulk extraction of 30-day schedules per carrier.
2. Population of internal routing graph database.
3. Implementation of recurring update cycles for schedule freshness.
4. Deployment of real-time API/Scraper for trip validation and pricing.